Estimation of a Proportion with Survey Data

نویسنده

  • Pierre Duchesne
چکیده

The estimation of proportions is a subject which cannot be circumvented in a first survey sampling course. Estimating the proportion of voters in favour of a political party, based on a political opinion survey, is just one concrete example of this procedure. However, another important issue in survey sampling concerns the proper use of auxiliary information, which typically comes from external sources, such as administrative records or past surveys. Very often, an efficient insertion of the auxiliary information available will improve the precision of the estimations of the mean or the total when a regression estimator is used. Conceptually, it is difficult to justify using a regression estimator for estimating proportions. A student might want to know how the estimation of proportions can be improved when auxiliary information is available. In this article, I present estimators for a proportion which use the logistic regression estimator. Based on logistic models, this estimator efficiently facilitates a good modelling of survey data. The paper’s second objective is to estimate a proportion using various sampling plans (such as a Bernoulli sampling and stratified designs). In survey sampling, each sample possesses its own probability and for a given unit, the inclusion probability denotes the probability that the sample will contain that particular unit. Bernoulli sampling may have an important pedagogical value, because students often have trouble with the concept of the inclusion probability. Stratified sampling plans may provide more insight and more precision. Some empirical results derived from applying four sampling plans to a real data base show that estimators of proportions may be made more efficient by the proper use of auxiliary information and that choosing a more satisfactory model may give additional precision. The paper also contains computer code written in S-Plus and a number of exercises.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modeling Nonnegative Data with Clumping at Zero: A Survey

Applications in which data take nonnegative values but have a substantial proportion of values at zero occur in many disciplines. The modeling of such “clumped-at-zero” or “zero-inflated” data is challenging. We survey models that have been proposed. We consider cases in which the response for the non-zero observations is continuous and in which it is discrete. For the continuous and then the d...

متن کامل

The Effect of Nonresponse Primary Sampling Units on Estimating the Variance of Changes by Jackknife Method (Case Study: Labor Force Survey Data for 2009 and 2010)

Abstract. According to the importance of presenting change estimation of labor force survey indicators along with their variance, in this paper, the use of Jackknife method in estimating variance of changes has been investigated. Then, the effect of nonresponse primary sampling units on estimating the variance of changes has been studied by use of Jackknife method via intensive simulation stud...

متن کامل

Estimation of the aerobic capacity by step test in the workers of a tile factory in Yazd in 2017

Object: Estimation of the Maximum Aerobic Capacity to Make a Physiological Proportion between Worker and the work is of calculation of the Maximum Aerobic Capacity is important to Make a Physiological Proportion between Worker and his work The purpose of this study is to estimate the highest aerobic capacity and Physical work capacityof tile and  ceramic workers . Analysis method :In this cros...

متن کامل

Large-scale Inversion of Magnetic Data Using Golub-Kahan Bidiagonalization with Truncated Generalized Cross Validation for Regularization Parameter Estimation

In this paper a fast method for large-scale sparse inversion of magnetic data is considered. The L1-norm stabilizer is used to generate models with sharp and distinct interfaces. To deal with the non-linearity introduced by the L1-norm, a model-space iteratively reweighted least squares algorithm is used. The original model matrix is factorized using the Golub-Kahan bidiagonalization that proje...

متن کامل

IMPLEMENTATION OF EXTENDED KALMAN FILTER TO REDUCE NON CYCLO-STATIONARY NOISE IN AERIAL GAMMA RAY SURVEY

Gamma-ray detection has an important role in the enhancement the nuclear safety and provides a proper environment for applications of nuclear radiation. To reduce the risk of exposure, aerial gamma survey is commonly used as an advantage of the distance between the detection system and the radiation sources. One of the most important issues in aerial gamma survey is the detection noise. Various...

متن کامل

A Bayesian Nominal Regression Model with Random Effects for Analysing Tehran Labor Force Survey Data

Large survey data are often accompanied by sampling weights that reflect the inequality probabilities for selecting samples in complex sampling. Sampling weights act as an expansion factor that, by scaling the subjects, turns the sample into a representative of the community. The quasi-maximum likelihood method is one of the approaches for considering sampling weights in the frequentist framewo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003